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(54) Mobile terminal and base station in a packet radio services network 



(57) There is disclosed a mobile terminal for com- 
municating with a base station in a packet radio serv- 
ices network. The terminal has a processor for 
determining one of a plurality of channels for communi- 
cation between the mobile terminal and the base sta- 
tion; for digitally coding speech to provide speech 
information; for assembling speech information into 
speech packets; and for generating channel allocation 
requests for a channel in which to send speech packets. 
A radio transmitter is provided for transmitting the 
requests" and the packets to a 'base station~ih fhe "net- 



work. A radio receiver receives identities of channels 
allocated by the base station for the mobile terminal to 
transmit on. The processor is responsive to each 
received channel allocation to determine that packets 
are sent on the allocated channel. 

in the GPRS since a channel is released when 
there is no packet to transmit, higher traffic levels can be 
obtained using the same number of radio channels. 



FIB. 1 



4 



DUPLEX 



8- 



TX 



RX 



-10 



A/D 



CODEC 



VOICE ACTIVITY 
DETECTOR 



CHANNEL 
ALLOCATION 
REQUEST 



ASSEMBLE 
PACKETS 



DETERMINE 
RADIO CH 



14- 



MIKE 



16 
•18 

•20 



12 



24 



DUPLEX 



28- 



_L 



TX 



RX 



-26 
•32 



± 



ALLOCATE SPEECH CH 



PROCESS ALL PACKETS IN 
TALKSPURT TO SAME 
DESTINATION ADDRESS 



RECOVER NETWORK & 
TRANSPORT LAYER HEADERS 
FROM FRAME HEADERS 



CORRECT ERRORS IN 
FRAME HEADERS 



-22 
-30 



BASE STATION 



MOBILE 



Primed bv Xerox fUK) Business Services 



EP 0 984 641 A2 



Description 

[0001 ] This invention relates to packet radio services networks. 

[0002] Standards are being defined for a general packet radio services network (GPRS). 
5 [0003] The invention is based on the recognition that if suitably designed a packet services network could carry 
speech. 

[0004] To this end, in accordance with the invention there is provided a mobile terminal for communicating with a base 
station in a packet radio services network, said terminal including a processor for determining one of a plurality of chan- 
nels for communication between the mobile terminal and the base station; for digitally coding speech to provide speech 

10 information; for assembling speech information into speech packets; and for generating channel allocation requests for 
a channel in which to send speech packets; a radio transmitter for transmitting the requests and the packets to a base 
station in the network; and a radio receiver for receiving identities of channels allocated by the base station for the 
mobile terminal to transmit on, said processor being responsive to each received channel allocation to determine that 
packets are sent on the allocated channel. 

is [0005] In the GPRS since a channel is released when there is no packet to transmit, higher traffic levels can be 
obtained using the same number of radio channels. 

[0006] Preferably, if a request to send is not granted, the processor is arranged to discard speech information until a 
further request is granted. As speech is highly time sensitive, it is better to discard the information than to send the infor- 
mation delayed. The discard produces clipping which, as long as it is not too frequent, is tolerable by the user. 
20 [0007] The processor is preferably arranged so that when a request to send is not granted, a further request is 
delayed by a predetermined period. 

[0008] The delay is preferably increased if successive requests are not granted. 
[0009] Following a predetermined maximum delay, the delay is reduced. 

[0010] The processor is preferably arranged to implement a layered protocol in which each packet is given a header 

25 in a subnetwork dependent convergence protocol layer (SDNDCP). 

[0011] Because of the time sensitive nature of speech the header is preferably a RTP/VDP/IP header. 
[0012] The mobile terminal preferably includes a voice activity detector, and the processor is preferably responsive to 
detection of voice activity by the voice activity detector, to generate a request for a channel allocation in which to send 
voice packets, and on receipt of a channel identity, to send the an address header uncompressed on that channel once 

30 and subsequently to send packets with compressed headers which do not contain the destination address on the iden- 
tified channel, until the voice activity detector detects no voice activity. 

[0013] The processor is preferably arranged to construct packets of an equal number n of frames, the processor being 
further arranged to implement a logical link layer protocol (LLC) which adds its own LLC header information comprising 
a service access point identifier defining speech service to each packet, and to divide the total LLC plus SNDCP header 
35 into n parts of equal length and to place one header part before each frame in the packet. This provides that every frame 
in the packet has the same format and allows a common protection strategy to be applied to each frame. The header 
information can be given an error correcting code. Speech is more error tolerant, however. More important parts of the 
speech information can be coded in order to identify that there is an error, in which case the frame is discarded. Less 
important parts of the speech information can be left unprotected. 
—,-40- ~ |00-14]~ -Thus,- in the-physieaUayer, in each frame, the- header-and the-most important-bits speech information-are pref- 
erably coded using a convolutional code, and a subset of the important bits of the speech information are coded using 
a cyclic redundancy check. 

[0015] The invention also extends to a base station including a radio receiver for receiving requests from mobile sta- 
tions to send data packets and requests to send speech packets and operable on a plurality of channels to receive data 
45 packets and speech packets; a processor for reserving a predetermined number of said channels for receiving coded 
speech packets, and for allocating nominating a free one of said predetermined number responsive to a request chan- 
nel allocation request in which to send a speech packet; and a transmitter for transmitting the allocated channel to the 
mobile station. 

[001 6] By dynamically managing the number of channels reserved for speech, optimum service can be given to both 

50 speech services and to data services given changing respective demands. 

[0017] The invention also extends to a base station including a radio receiver for receiving requests from mobile sta- 
tions to send data packets and requests to send speech packets and operable on a plurality of channels to receive data 
packets and speech packets; a processor for nominating channels for a mobile station to send speech packets and for 
processing packets in a talk spurt comprising a single destination address header followed by a plurality of speech 

55 packets not containing a destination address, for transmission over the network. 

[0018] The invention further extends to a base station including a radio receiver for receiving requests from mobile 
stations to send data packets and requests to send speech packets and operable on a plurality of channels to receive 
data packets and speech packets; a processor for implementing a protocol which recovers network and transport layer 
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headers and logical link layer headers for a packet, from equal parts of each frame in the packet. 
[001 9] The processor may be operative in each frame to correct errors in the header and the most subjectively impor- 
tant bits of the speech information only. 

[0020] One embodiment of the invention will now be described, by way of example, with reference to the accompa- 
5 nying drawings, in which: 

Figure 1 is a block diagram of a GPRS mobile terminal and base station embodying the invention; 

Figure 2 shows schematically the operation of RLC/MAC protocol; 

Figure 3 shows network layer protocol layers; 
10 Figure 4 shows the SNDCP model operation to support voice; 

Figure 5 shows the format of an LLC-PDU; 

Figure 6 shows the organisation of TDMA frames in GPRS; 

Figure 7 shows the GPRS TDMA multiframe structure; 

Figure 8 shows the partition of channels in a GPRS carrying speech; 
15 Figure 9 shows how source coded bits output from the codec are protected; 

Figure 10 shows the operation of each layer in the protocol; and 

Figure 1 1 show the frame structure prior to convolutional coding. 

[0021] Referring to the drawings, a mobile terminal 2 has an antenna 4 coupled to a duplexer 6. The duplexer 6 is 
20 coupled to a transmitter 8 and a receiver 10. Signals received by the receiver 1 0 are fed to a processor 1 2. Sound waves 
of speech are transduced to analog electrical signals by a microphone 14 and the analog signals are converted to digital 
by the processor which may have one or more central processing units (not shown). An analog to digital converter may 
be a self contained unit 1 6. The processor processes the digitised speech which is then coded by a parametric codec 
algorithm, e.g. EFR, to produce speech frames. A codec may be a self contained unit 18. 
25 [0022] A voice activity detector algorithm detects the presence of speech distinguished from silences. The voice activ- 
ity detector may be a self contained unit 20. When speech is detected, the processor assembles speech information 
output from the codec with network and transport layer headers into fixed length packets of two frames and sends a 
channel allocation request. 

[0023] As may be seen from the block diagram of Figure 2, if the channel allocation request is refused, a delay is intro- 
30 duced before a new request is sent and speech frames occurring during the delay are discarded. 

[0024] A base station 22 has an antenna 24 feeding a duplexer 26. A radio receiver 28 sends packets received from 
the mobile terminal 2, to a processor 30. Data for transmission to the mobile terminal 2 is sent to a radio transmitter 32 
coupled to the duplexer 26. 

[0025] Network layer protocols, illustrated in Figure 3, are intended to be capable of operating over services derived 
35 from a wide variety of subnetworks and data links. GPRS was designed from the outset to support several network layer 
protocols providing network transparency for the users of the service. Introduction of new network layer protocols to be 
transferred over GPRS was to be allowed without any changes to the GPRS network, a function carried out by the sub- 
network dependent convergence protocol (SNDCP). In addition, SNDCP carries out header and data compression, and 
multiplexing of data coming from different sources to be sent over the LLC layer. 
40 [0026] _ IP is used as the network protocol with RTP being used to provide suppojtjbrjhe. T^LtiJTl^tie^cning SU P~, 
plying tirnestamp information and packet sequencing. SNDCP currently only provides for TCP/IP and lP(v4) header 
compression by implementing the RFC1 144 compression algorithm. However, the SNDCP specifications also allow for 
additions to the list of supported compression protocols, according to the requirements of new applications and serv- 
ices. The present system employs the RTP/UDP/IP protocols which involve an overhead of 40 octets, corresponding to 
45 320 bits. 

[0027] Using packets of two frames length, it is necessary to support some form of compression for these transport 
and network layer headers. Indeed, if the CS-I channel coding scheme were to be used, the combined RTP/IUDP/IP 
headers would occupy the entire information payioads of two radio blocks, leaving no space for any speech information 
or logic link control (LLC) headers. 
so [0028] A high compression efficiency may be obtained by treating the IP/UDP and RTP headers together rather than 
separately. Although it is contrary to the ethos of layered architecture, crossing these protocol layer boundaries is 
appropriate because the same function is being applied across all layers. 

[0029] There are two main properties of the transmitted packets which are used to carry out header compression. 
The first factor-of~two reduction in data rate comes from the observation that half of the bytes in the headers remain 
55 constant over the life of the connection. An obvious example is the source and destination addresses and ports. The 
uncompressed header is sent once, during a connection-establishment phase. These fields are then deleted from the 
compressed headers that follow without any real loss of information. 

[0030] The remaining compression comes from differential coding on the changing fields to reduce their size. In par- 
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ticular, for RTP header compression, a big gain in efficiency comes from the observation that although several fields 
change in every packet, such as the sequence number and the timestamp, the difference from packet to packet is often 
constant, and therefore the second-order difference is zero. By making use of these properties, the massive combined 
RTP/UDP/IP header can be reduced to two bytes or three bytes, depending upon whether a header checksum is used. 
5 As at least part of the end-to-end link includes at least one mobile propagation path, which is by its very nature subject 
to error, it would be useful to include the header checksum in the scheme employed. Although it is not be used for error 
correction or frame retransmission schemes, it gives an indication that part of the header may be corrupted and to 
ignore the timing information for that particular packet. 

[0031] SNDCP also supports data compression by means of the V.42 bis data compression algorithm. However, as 
10 the application layer which sits above the SNDC layer already includes a lossy source coder in the form of a speech 
codec, there stands little to be gained by applying data compression by means of entropy coding, as most redundancy 
in the original information would have been already extracted. In addition, source coding modifies the speech coder bit 
patterns and makes it difficult to apply differential channel coding to the speech frame according to the subjective impor- 
tance of the different bit positions. 

15 [0032] Figure 4 shows the SNDCP model operation to support voice. Analysis of the Voice over GPRS delay budget 
showed that maximum payload efficiency can be achieved by encapsulating two speech frames into a single network 
packet. Increasing the number of speech frames accommodated by a single network packet brings about a proportional 
increase in the packet buffering delay, thereby* incr easing the maximum end-to-end delay threshold of 200 ms. 
[0033] The SNDCP layer therefore accepts the combined RTP/UDP/IP headers and the speech frames through two 

20 different service access points. Header compression is carried out, and the resulting header segmented into two sec- 
tions for addition to the two speech frames that is encapsulated into that particular packet. This system allows for the 
two radio link control (RLC) blocks containing the speech information to have exactly the same layout, and therefore use 
exactly the same channel coding scheme for both blocks. As the forward error correction is tailored to catering for the 
properties of a particular speech coder, it is important to ensure that each bit position within an radio block refers to the 

25 same bit position within a speech frame for all transmitted blocks. The first received speech frame belonging to a par- 
ticular network packet is forwarded directly to the lower layer without waiting for the second frame to arrive. 
[0034] The Logical Link Control layer operates above the RLC and BSSGP layers in the illustrated architecture to pro- 
vide highly reliable logical links between a mobile terminal and its serving GPRS support node (SGSN). Its main func- 
tions are designed towards supporting such a reliable link and they include sequence control of LLC frames across a 

30 logical link, the detection of transmission, format and operational errors on a logical link connection, the notification of 
unrecoverable errors and flow control. 

[0035] The operation of the LLC protocol can be better understood by examining the format of an LLC-PDU shown in 
Figure 5. 

[0036] As can be seen, the LLC frame header is divided into two main sections, the Address Field and the Control 

35 Field. In the Address field is the Service Access Point Identifier (SAP I). This represents a point at which LLC services 
can be accessed and provides a means by which the Quality of Service priority can be defined. As ten out of a possible 
sixteen different identifiers currently remain vacant in the specifications, a new SAPI can be defined for voice services, 
instructing the layers above, namely the SNDCP and the BSSGP about the priority required by voice packets over data 
traffic. The conventional control field contains two sub-fields, represented by N(S) and N(R), whose function it is to 

„ AO,- -determine-the^ositioa-of a particular- LLC frame within a sequence c&ftames ^nstrtutag^sw^ 

ever, this function is superfluous within the context of the Voice over GPRS system there is no segmenting of network- 
PDUs, as each N-PDU fits exactly into a single LLC-PDU. These fields are therefore be omitted within the context of 
transporting real-time voice packets, without any loss of functionality. Each LLC-PDU conventionally ends with a 24-bit 
long footer containing a frame check sequence. This enables the LLC layer to ensure that the LLC frame is free of errors 

45 (within the capabilities of the CRC check) before passing it on to the network layer at the SGSN for delivery through the 
backbone network. Should errors be found, it signals for retransmission by means of the RLC layer selective repeat 
request system. However, as repeat-request systems are not used in the present implementation, and as there already 
exists a cyclic redundancy check at the physical layer, the FCS field within the LLC-PDU is also omitted without affecting 
the functionality of the system when transporting speech services. Indeed, should this field be retained, it would be 

so merely ignored by the receiving process, as even if errors were to be detected, the process would still forward the 
packet, because as already described, coded speech has an inherent information corruption tolerance. 
[0037] The system therefore accepts the two segments of the SNDC-PDU containing the two speech frames which 
belong to the same network packet, and add the new, 8-bit LLC header containing the SAPI for voice services to the 
first arriving segment so that the two frames in the packet have headers of equal length. This is then forwarded to the 

55 RLC/MAC layer for immediate dispatch over the radio interface. When the peer LLC process at the BSS receives the 
first radio block containing information from that particular LLC-frame, it identifies that it contains speech information by 
examining the service access point identifier. This information instructs the process not to look for further header infor- 
mation after the first 8 bits, and not to expect any frame check sequence. The space freed by removing these fields is . 
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used to carry more user payload information. 

[0038] The RLC/MAC (medium access control) layer provides services for the transfer of upper iayer PDUs using a 
shared medium between multiple mobile stations and the network. This transfer may take place in the form of unac- 
knowledged operation or acknowledged operation, according to the nature of the service required. As its very name 

5 implies, the RLC/MAC protocol actually consists of two separate protocols with different functions. The Radio Link Con- 
trol Layer defines the procedures for segmentation and reassembly of LLC PDUs into RLC/MAC blocks and in RLC 
acknowledged mode of operation, for the Backward Error Correction (BEC) procedures enabling the selective retrans- 
mission of unsuccessfully delivered RLC/MAC blocks. When operating in the RLC acknowledged mode, the RLC layer 
preserves the order of higher layer PDUs provided to it On the other hand, the MAC (Medium Access Control) function 

io defines procedures that enable multiple mobile stations to share a common transmission medium which may consist of 
several physical channels. The GPRS MAC protocol allows for a single user to use more than one timeslot concurrently 
so as to increase throughput. In addition, the same timeslot may be multiplexed between up to eight users, so as to 
increase the number of users operating on a given set of system resources, in the Voice over GPRS system, there is 
no provision for mulitsiotting, and for the duration of time for which a channel is occupied by a single user transmitting 

is speech information, this channel is not multiplexed with any other users. In addition, no use is made of the retransmis- 
sion functions of the RLC layer 

[0039] GPRS shares the same radio interface as the GSM circuit-switched voice system. This means that each phys- 
ical frequency channel is divided into eight traffic channels by means of time-division multiplexing. Each timeslot within 
a TDMA frame can be dynamically allocated to GPRS services or GSM services according to the relative shifts in 
20 demands for the two services, with those channels allocated to packet data traffic being referred to as Packet Data 
Channels (PDCH). 

[0040] Each time division multiple acces (TDMA) frame lasts for 4.615ms and can accommodate eight PDCHs within 
its eight timesiots. The data to be transmitted by means of the Packet Data Traffic Channels (PDTCH) is segmented into 
units of 114 bits each, which are then encapsulated into radio bursts for insertion into a single TDMA timeslot which 
25 lasts for 576.8us This means that each RLC/MAC block consisting of 456 bits is segmented and interleaved into four 
consecutive radio bursts. 

[0041] GPRS multiplexing differs from that found GSM circuit-switched speech in the way the TDMA frames are 
organised into multiframes. Whereas GSM supports 26-frame and 51 -frame multiframes, in GPRS the TDMA frames 
are organised into 239.980 ms-long multiframes consisting of 52 frames. This organisation is divided into 12 radio~ 
30 blocks of four frames each, and four idle frames as shown in Figure 6. 

[0042] Referring to Figure 7, it should be noted that all 52 bursts in the picture belong to the same timeslot, and con- 
sequently to the same PDCH. In GPRS, several logical channels not found in GSM circuit-switched services are intro- 
duced and accommodated onto the GSM physical channels (PDCH). 

[0043] Referring to Figure 8, the Packet Data Traffic Channel, which is used to actually carry the speech information 
35 and the Packet Access Grant channel which is used for channel contention are mapped onto the PDCH by what is 
known as the 'Master-Slave' concept. In this system, at least one PDCH (or timeslot), acting as a master, accommo- 
dates user data and dedicated signalling, and packet common control channels that carry all necessary control signal- 
ling for initiating packet transfer. Such control signalling refers only to the access bursts on the packet random access 
channel (PRACH). All other PDCHs, acting as slaves are used for user data transfer and dedicated signalling only. 
,1S^~.J59^l^fPJ^^.^X^^^r channel is capable of bearing.both the. PRACH and a PDTCH.sio^ 

the physical channel between the two logical channels by means of a time-division multiplexing mechanism. Usually, 
the bulk of the physical resources of the Master timeslot are allocated to carrying user data, with one block every t 
blocks being dedicated to supporting random access attempts, where t is typically an integer with value 3, 4 or 5. In this 
way, whereas seven slave channels are entirely dedicated to supporting voice and data traffic, the eighth channel, 
45 which is usually time slot zero (TSO) is used both as a traffic channel, and as a random access medium for all terminals 
operating at that particular radio frequency channel. 

[0045] The present implementation for supporting voice services requires that the master channel be left for control 
signalling only and not be allowed to accept any PDTCHs. The reason for making such a reservation is the extra delay 
that sharing the master channel would impose on the average access time. If, for example, t was set to three, one radio 
so block out of every three available radio blocks in the master channel would be dedicated to supporting the PRACH. This 
means that a mobile terminal has to wait approximately 3 RLC blocks, equivalent to 55.3 ms until it is allowed to fire the 
next random access. This extra delay is clearly desirable for real-time voice services. 

[0046] The remaining seven slave channels are then divided into those Packet Data Traffic Channels dedicated to 
voice services and those dedicated to data services. The system does not allow a PDTCH to share voice and data serv- 
55 ices, as the delay requirements are radically different By allowing GPRS voice users access to channels dedicated to 
carrying voice services, better control can be made by the base station to ensure that the required Quality of Service 
over the radio link in terms of access delay and speech frame loss rate are met 

[0047] The operation of the RLC/MAC protocol is summarised in Figure 2. if a GPRS-terminal is voice-capable and 
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wishes to initiate a conversation over the GPRS network, it initiates a call-setup procedure. In this process, that the 
Base Station monitors the current load in the cell of operation of the user and determine if it can support another voice 
user. If it can, the base station informs the mobile terminal that it has been admitted to the network and may initiate 
transmission of voice packets. The mobile terminal then goes into an idle mode where it waits for an indication from the 

5 terminal's voice activity detector (VAD) that a speech activity has been detected and a talkspurt has begun. A random 
access burst is sent over the PRACH, and it waits for a reply from the Base Station over the PAGCH. If channel 
resources are available, it indicates to the MS that it has allocated a single channel for the transmission of the talkspurt. 
In order not to be any more channel inefficient than the equivalent GSM circuit-switched voice services, multi-slotting is 
not enabled for the transmission of voice services over GPRS. This means that a single talkspurt may be transmitted in 

10 a single timeslot only, and the base station does not allocate more than this single PDTCH for this purpose. The mobile 
terminal then proceeds to transmit all the RLC/MAC blocks belonging to that particular talkspurt. On the onset of the 
following silence period, the mobile station stops transmitting RLC blocks, thereby indicating to the BS that it is releasing 
the channel. This means that channel contention occurs at the beginning of each talkspurt only, and once a terminal 
has acquired use of a traffic channel, it only relinquishes it when all the LLC frames corresponding to that particular talk- 

15 spurt have been transmitted. The Base Station then allocates this channel to the pool of channels available for the 
transmission of voice services. If, however, there are no available PDTCHs allocated for the support of voice services, 
the BS informs the MS of the situation. The mobile station then enters a random exponential backoff period, at the com- 
pietion qf which it reattempts to access the channel. 

[0048] As real-time speech is time-sensitive information, all the RLC/MAC blocks corresponding to speech frames 

20 generated during the backoff, save the most current one are discarded. For every failure to access the channel, a coun- 
ter is Incremented and used to determine the duration of the backoff period. If however a collision occurs between 
access bursts in the same timeframe on the PRACH, the Base Station is not able to determine which MS initiated the 
request, and so is unable to respond. The mobile terminal notices that such a collision has occurred by employing a 
timer set to a value slightly longer than the average response time of the base station. Should this value expire, the 

25 mobile station enters the backoff period in the same way as when channel access has been denied. 

[0049] Both data terminals as well as voice terminals implement an exponential backoff algorithm, where after an 
unsuccessful access attempt, a uniform distributed random number w elO^^] is drawn, where n is the number of 
access attempts. For data terminals, the next random attempt is tried after a waiting time of w*J8.5ms, while for voice 
terminals, multipliers of 4.615ms and 8.5ms respectively are used. 

30 [0050] As real-time conversational voice is a delay-critical service it is important that the medium access algorithm is 
geared towards keeping the access delay of the system to a minimum. A uniform distributed random number is selected 
from the range [0,2 n+r 1] The two parameters which when varied can alter the delay performance of the system are the 
maximum increment allowed, n, and a constant by which the random variable is multiplied in order to obtain the waiting 
time. Experiments have been carried out with an unbounded exponential backoff, meaning that no limit was set to the 

35 value of the increment counter, and with a time multiplier of one radio block, corresponding to 1 8.5ms. The experiments 
however showed that system performance could be improved by fixing n to an upper limit of 5, and reducing the time 
multiplier to the duration of a single TDMA frame, i.e. 4.615ms. 

[0051] The physical RF transmits information bits over the radio path and deals with issues such as the carrier fre- 
quency characteristics and GSM radio channel structures, the modulation of transmitted waveforms and the transmitter 
^d-o^ ^and,r-eceiver GharaGteristics and their respective perfe^a^ce^equirementSv-GPRS-shares-all-these specifications with — - 
the GSM speech standards, and as a consequence can make use of much of the current GSM radio infrastructure. 
[0052] The Physical Link Layer operates above the physical RF layer to provide a physical channel between the 
mobile terminal and the Base Station Subsystem (BSS). One of its main responsibilities is forward error correction cod- 
ing (FEC), which allows for the detection and correction of transmitted code words and the indication of uncorrectable 
45 code words. In addition, the physical link layer carries out rectangular interleaving of radio blocks over four bursts in con- 
secutive TDMA frames, and procedures for detecting physical link congestion. 

[0053] GPRS currently supports four channel coding schemes, ranging from a half-rate convoiutionai scheme (CS-I) 
to a scheme which provides for no channel coding (CS-4). 

[0054] Schemes CS-2 and CS-3 are punctured versions of the CS-l scheme, giving resulting code rates of approxi- 

50 mately 2/3 and 3/4 respectively 

[0055] When deciding which of the currently available coding schemes to use to protect coded speech, it is desirable 
to examine the speech coding techniques used to protect speech in circuit-switched GSM systems. As an example, we 
prefer to use the new GSM Enhanced Full Rate Coder (EFR). This is an Algebraic Codebook Excited Linear Predictive 
(ACELP) coder. It is essentially a parametric, rather than a waveform coder. This means that the speech is represented 

55 by a number of parameters describing, amongst other things, the pitch period of the speech, a number of LPC. coeffi- 
cients representing the waveform shaping effects that occur in the vocal tract and a description of the excitation that is 
generated by the speaker's vocal chords. Though these parameters are generated so as to maximise the coder effi- 
ciency by bringing to a minimum the redundancy present in the information being transmitted, not all parameters have 



6 



EP 0 984 641 A2 



an equal effect on the perceptible audio quality of the speech. This means, that when in error, some parameters, and 
consequently bits, cause a greater distortion in speech quality than other bits. This phenomenon is exploited when pro- 
tecting the source coded bits by means of FEC. 

[0056] Figure 9 shows how the 244 source coded bits output from the EW codec are protected to varying degrees 

5 according to the subjective importance of the bits. Type 1 bits are protected using a half-rate coder identical to that used 
by the CS-1 scheme, while a subset of 65 bits, including all the Class 1 a bits are protected by means of a cyclic redun- 
dancy check (CRC). This is used in order to test for channel conditions. If the CRC check on these 65 bits which repre- 
sent the subjectively most important bits detects an error, then the receiver considers the speech frame as having 
suffered an unacceptable degradation in audio quality and consequently drops the frame. In order to maintain a com- 

io parable speech quality with that offered by OSM circuit-switched services, it is necessary to maintain the same level of 
channel protection on these bits, because otherwise there would result an increase in the frame dropping rate, with a 
corresponding decrease in the ensuing speech quality, if the current GPRS coding schemes are to be used, the only 
scheme which would provide this level of protection is CS-1 . This would however protect all bits indiscriminately to the 
same extent without taking into consideration the subjective weighting of the source-coded bits. Using the CS-i scheme 

is would leave a 1 81 -bit payload, which corresponds to 9.05 kbit/s However this figure does not take into consideration the 
headers belonging to the RLC/MAC layer, which as can be seen in Figure 10, occupy 21 bits. This further reduces the 
information payload to 160 bits, corresponding to a data throughput of 8 kbit/s Adding the LLC and SNDCP headers 
, into the information payload, further reduces the user data throughput to about 5.6 kbit/s, depending on what function- 
ality is included in the headers of those layers. 

20 [0057] A far more attractive solution to implementing channel coding within the Voice over GPRS system is to use a 
channel coding scheme tailored around the requirements of the particular speech coder being used. This allows for the 
greatest efficiency by only offering powerful protection to the subjectively most important bits, while offering different 
levels of channel coding to the remaining bits. This scheme also allows for the headers belonging to the RLC/MAC LLC 
and SNDCP layers to be powerfully protected. The present solution is to use the CS-i coding scheme to protect those 

25 radio block data payload bit positions which are occupied by these higher level headers. 

[0058] Current coding schemes specify the use of a CRC-based Block Check Sequence (BCS) to detect errors within 
a radio block. If any such errors are detected, the RLC layer is informed, and retransmission requested, if operating in 
acknowledged mode. The BCS is also used by the base station subsystem to monitor channel conditions between the 
mobile stations and the base station. In all coding schemes except CS-l, the length of the BCS is sixteen bits and it 

30 operates over the entire length of the radio block. As retransmission is not used in the present voice system, it suffices 
to be able to detect errors over the section of the data payload used by the protocol headers only. This allows for a 
reduced-length sequence of 8 bits to be used. 

[0059] Figure 10 shows the GPRS system using the enhancements to the protocols as described above. At the 
SNDCP layer, header compression is carried out on the joint RTP/UDP/iP headers to give a compressed SNDCP 
35 header which is only 24 bits long. The SNDC-PDU is then reorganised in such a way that the header is segmented into 
two sections of differing length, so as to allow for an identical payload format in both transmitted radio blocks. This 
means that while 8 bits of the SNDCP header are retained at the beginning of the packet, the remaining 16 header bits 
are placed towards the middle of the packet. 

[0060] The SNDCP-packet is then encapsulated into an LLC frame with the addition of the new, truncated 8-bit LLC 
_headejxTha.RLGMAG blocks are formed by segmenting the received LLC packet into two. In order toJmplement the 
asymmetric buffering described above, the first produced speech frame is forwarded down to the RLC/MAC layer, 
together with 16 bits of combined SNDCP and LLC headers for immediate forwarding over the radio channel. This is 
followed 20 ms later by the second segment of the LLC frame, this time containing the remaining 16 bits of the SNDCP 
header and the contents of the second speech frame. This system ensures that both RLC/MAC blocks have exactly the 

45 same layout. At the RLC/MAC layer, a further 21 bits of header are added to both blocks together with a 3 bit USF and 
a short 8 bit BCS. Each block is then channel coded using a new coding scheme optimised for speech, which is referred 
to as CSS-1 (Coding Scheme-Speech I). Although each speech coding implementation uses a different channel coding 
scheme, it is assumed that the header information is always powerfully protected using a half-rate code. This leaves 
360 bits as a gross speech payload, which is used to contain both the speech information as well as the portion of chan- 

50 nel coding for speech only. This translates to a data throughput of 1 8 kbit/s, which is a considerable increase on the 5.6 
kbit/s available using the current standards, even when taking into consideration the throughput in the present scheme 
that has to be used for channel coding. 

[0061] Using the GSM EFR speech coder within the 360 bit payload available in GPRS blocks, each EFR speech 
frame is 244 bits long, so that 116 bits can be used for channel coding, which represents a throughput of 5.8 kbit/s. 
55 [0062] The current system used for implementing speech services in GSM by means of the EFR coder allows for 212 
bits for every speech frame. 

[0063] Conventionally, EFR speech frames undergo two stages of speech coding, bringing the gross payload from 
244 bits to 260 bits and 456 bits respectively. The preliminary coding stage allows for the duplication of the most signif- 
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icant Class 2 bits. As this feature is not made use of in the present example so these bits are discarded. The 8-bit CRC 
will however be retained. The resulting frame structure prior to convolutional coding is therefore is shown in Figure 11. 
[0064] The nomenclature of the classes has been changed so as to remove ambiguity. Whereas both Class A and 
Class B bits are protected by convolutional coding, the Class A bits enjoy additional protection provided by the EFR's 
8-bit CRC. The Class C bits are unprotected and are equivalent to the GSM Class 2 bits. The frame size is reduced from 
260 bits to 252 bits by discarding the repeated bits. The basic channel protection scheme for this speech frame struc- 
ture involves applying the convolutional code specified in [GSM 05.03] to the bits of Classes A and B, while leaving the 
Class C bits unprotected. The FR's 3-bit CRC is not implemented. Figure 1 1 also outlines the two ways of reducing the 
gross speech and channel coding payload to 360 bits, from the current requirement of 442 bits. Note that the difference 
between this value and the block size of 456 is equivalent to discarding the 3-bit CRC and the repetition bits. 
[0065] The first scheme involves puncturing the Class B bits. Class A bits are not punctured so as not to affect the 
frame dropping rate which is determined by the 8-bit CRC operating upon them. The Class B bits are punctured using 
a uniform puncturing mechanism in which one every k coded bits are removed at the source coder. The resulting reduc- 
tion in required throughput is shown in the following table: 



puncturing depth k 


code rate 


payload 


required bit rate 


3 


0.751 


362 


18100 


4 


0.666 


382 


13100 


5 


0.623 


394 


19700 


6 


0.600 


402 


20100 


7 


0.584 


407 


20350 


8 


0.570 


412 


20600 




0.500 


442 


22100 



[0066] From the results shown in the table, it can be seen that the required bit rate of 18.0 kbit/s can be practically 
achieved by puncturing the Class B bits only. 

[0067] Another method by which the gross throughput required to transport the speech may be reduced is by varying 
the position of the threshold T as shown in Figure 1 1 . In this way, the number of bits falling into Classes A and B may 
be reduced, thereby increasing the number of unprotected bits. The variation in required payload as a function of the 
number of Class B is shown in the following table. 



number of class A & B 
bits 


gross pay load 


Tetpirernhroughput Kb/s 


182(EFR) 


442 


22.1 


160 


420 


21.0 


140 


400 


20.0 


120 


380 


19.0 


100 


360 


18.0 


80 


340 


17.0 



[0068] Examining the effects of puncturing shows that the distortion seems to be negligible at puncturing depths of, 
and in excess of 5. It was therefore decided to use this depth and a depth of 4, as a basis for reducing the number of 
bits required by channel coding. The position of the threshold was then determined by ensuring that the total number 
of channel-coded bits do not exceed 360 bits. 
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Claims 

1 . A mobile terminal for communicating with a base station in a packet radio services network, said terminal including 
a processor for determining one of a plurality of channels for communication between the mobile terminal and the 
base station; for digitally coding speech to provide speech information; and for assembling speech information into 
speech packets; for generating channel allocation requests in which to send speech packets; a radio transmitter for 
transmitting the requests and the packets to a base station in the network; and a radio receiver for receiving iden- 
tities of channels allocated by the base station for the mobile terminal to transmit on, said processor being respon- 
sive to each received channel allocation to determine that packets are sent on the allocated channel. 

2. A mobile terminal as claimed in claim 1 , wherein if a request to send is not granted, the processor is arranged to 
discard speech information until a further request is granted. 

3. A mobile terminal as claimed in claim 2, wherein the processor is arranged so that when a request to send is not 
granted, a further request is delayed by a predetermined period. 

4. A mobile terminal as claimed in claim 3, wherein the delay is increased if successive requests are not granted. 

5. A mobile terminal as claimed in claim 4, wherein following a predetermined maximum delay, the delay is reduced. 

6. A mobile terminal as claimed in any preceding claim, wherein the processor is arranged to implement a layered pro- 
tocol, and wherein each packet is given a network and transport layer header in a subnetwork dependent conver- 
gence protocol layer (SDNDCP). 

7. A mobile terminal as claimed in claim 6, wherein the header is a RTP/VDP/JP header. 

8. A mobile terminal as claimed in claim 6 or 7, including a voice activity detector, and wherein the processor is 
responsive to detection of voice activity by the voice activity detector, to generate a request for a channel allocation 
in which to send voice packets, and on receipt of a channel identity, to send am address header uncompressed on 
that channel once and subsequently to send packets with compressed headers which do not contain the destina- 
tion address on the identified channel, until the voice activity detector detects no voice activity. 

9. A mobile terminal as claimed in claim 7 or 8, wherein the processor is arranged to construct packets of an equal 
number n of frames, the processor being further arranged to implement a logical link layer protocol (LLC) which 
adds its own LLC header information comprising a service access point identifier defining speech service to each 
packet, and to divide the total LLC plus SNDCP header into n parts of equal length and to place one header part 
before each frame in the packet. 

10. A mobile terminal as claimed in claim 9, wherein in the physical layer, in each frame, the header and the most 
important .bfe^p^ecb.. information are coded using_a Gonvoiutionai code, and .a .subset _of important bits of the. 

speech information are coded using a cyclic redundancy check. 

11 . A base station operating in a packet radio services network in communication with a mobile terminal as claimed in 
any preceding claim, said base station including a radio receiver for receiving requests from mobile stations to send 
data packets and requests to send speech packets and operable on a plurality of channels to receive data packets 
and speech packets; a processor for reserving a predetermined number of said channels for receiving coded 
speech packets, and for allocating a free one of said predetermined number responsive to a channel allocation 
request to send a speech packet; and a transmitter for transmitting the allocated channel identity to the mobile sta- 
tion. 

12. A base station operating in a packet radio services network in communication with a mobile terminal as claimed in 
any of claims 1 to 10, the base station including a radio receiver for receiving requests from mobile stations to send 
data packets and requests to send speech packets and operable on a plurality of channels to receive data packets 
and speech packets; a processor for nominating channels for a mobile station to send speech packets and for 
processing packets in a talk spurt comprising a single destination address header followed by a plurality of packets 
not containing a destination address, for transmission over the network. 

13. A base station operating in a packet radio services network in communication with a mobile terminal as claimed in 
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any of claims 1 to 10, the base station including a radio receiver for receiving requests from mobile stations to send 
data packets and requests to send speech packets and operable on a plurality of channels to receive data packets 
and speech packets; a processor for implementing a protocol which recovers network and transport layer headers 
and logical link layer headers for a packet, from equal parts of each frame in the packet. 

14. A base station as claimed in claim 13, wherein the processor is operative in each frame to correct errors in the 
header and the most subjectively important bits of the speech information part only. 
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